Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 19926 |
| Missing cells | 284731 |
| Missing cells (%) | 57.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.8 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Text | 13 |
|---|---|
| Categorical | 5 |
| Numeric | 3 |
| Boolean | 4 |
original_amount is highly overall correlated with original_currency1 and 6 other fields | High correlation |
transaction_month is highly overall correlated with transaction_year and 1 other fields | High correlation |
transaction_year is highly overall correlated with transaction_month and 1 other fields | High correlation |
original_currency1 is highly overall correlated with original_amount and 5 other fields | High correlation |
trx_currency is highly overall correlated with transaction_month and 8 other fields | High correlation |
division is highly overall correlated with original_amount and 2 other fields | High correlation |
transaction_day_of_week is highly overall correlated with original_amount and 3 other fields | High correlation |
weekday_transaction is highly overall correlated with original_amount and 2 other fields | High correlation |
transaction_to_original_diff is highly overall correlated with original_amount and 3 other fields | High correlation |
currency_change is highly overall correlated with original_amount and 3 other fields | High correlation |
transaction_freq_gt_weekly is highly overall correlated with original_amount and 1 other fields | High correlation |
original_currency1 is highly imbalanced (93.2%) | Imbalance |
trx_currency is highly imbalanced (99.8%) | Imbalance |
division is highly imbalanced (53.4%) | Imbalance |
weekday_transaction is highly imbalanced (77.1%) | Imbalance |
transaction_to_original_diff is highly imbalanced (84.7%) | Imbalance |
currency_change is highly imbalanced (84.7%) | Imbalance |
transaction_freq_gt_weekly is highly imbalanced (99.8%) | Imbalance |
purpose has 12052 (60.5%) missing values | Missing |
merchant_name has 11854 (59.5%) missing values | Missing |
cost_center_wbls_element_order_description has 11865 (59.5%) missing values | Missing |
card_posting_date has 11854 (59.5%) missing values | Missing |
merchant_type_mcc has 11854 (59.5%) missing values | Missing |
merchant_type_description has 11854 (59.5%) missing values | Missing |
original_currency1 has 11855 (59.5%) missing values | Missing |
cost_center_wbls_element_order has 11857 (59.5%) missing values | Missing |
transaction_date has 11855 (59.5%) missing values | Missing |
transaction_amount has 11855 (59.5%) missing values | Missing |
trx_currency has 11855 (59.5%) missing values | Missing |
gl_account_description has 11855 (59.5%) missing values | Missing |
original_amount has 11855 (59.5%) missing values | Missing |
division has 11855 (59.5%) missing values | Missing |
gl_account has 11855 (59.5%) missing values | Missing |
batch_transaction_id has 11855 (59.5%) missing values | Missing |
transaction_gt_50 has 11855 (59.5%) missing values | Missing |
transaction_day_of_week has 11855 (59.5%) missing values | Missing |
transaction_month has 11856 (59.5%) missing values | Missing |
transaction_year has 11856 (59.5%) missing values | Missing |
weekday_transaction has 11856 (59.5%) missing values | Missing |
transaction_to_original_diff has 11856 (59.5%) missing values | Missing |
currency_change has 11856 (59.5%) missing values | Missing |
transaction_freq_gt_weekly has 11856 (59.5%) missing values | Missing |
original_amount is highly skewed (γ1 = 86.56852089) | Skewed |
Reproduction
| Analysis started | 2023-08-17 14:40:22.776978 |
|---|---|
| Analysis finished | 2023-08-17 14:40:30.534365 |
| Duration | 7.76 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
Unnamed: 0
Text
| Distinct | 15941 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 155.8 KiB |
Length
| Max length | 324 |
|---|---|
| Median length | 309 |
| Mean length | 158.7844 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3163938 |
|---|---|
| Distinct characters | 79 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 13758 ? |
|---|---|
| Unique (%) | 69.0% |
Sample
| 1st row | 1003 |
|---|---|
| 2nd row | 1002 |
| 3rd row | 895 |
| 4th row | 896 |
| 5th row | 894 |
| Value | Count | Frequency (%) |
| 23174 | 12.0% | |
| recreation | 11858 | 6.1% |
| forestry | 11854 | 6.1% |
| and | 4028 | 2.1% |
| stores | 3432 | 1.8% |
| supply | 2627 | 1.4% |
| educational | 2568 | 1.3% |
| non-alcoholic | 2273 | 1.2% |
| for | 2233 | 1.2% |
| not | 1174 | 0.6% |
| Other values (71708) | 127848 |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 306324 | 9.7% |
| 173179 | 5.5% | |
| 0 | 163740 | 5.2% |
| 2 | 160186 | 5.1% |
| 1 | 134751 | 4.3% |
| e | 111836 | 3.5% |
| R | 108212 | 3.4% |
| E | 107140 | 3.4% |
| A | 104196 | 3.3% |
| S | 94417 | 3.0% |
| Other values (69) | 1699957 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1108553 | |
| Decimal Number | 760869 | |
| Lowercase Letter | 633394 | |
| Other Punctuation | 407021 | 12.9% |
| Space Separator | 173191 | 5.5% |
| Dash Punctuation | 80380 | 2.5% |
| Open Punctuation | 346 | < 0.1% |
| Close Punctuation | 184 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 111836 | |
| s | 79178 | |
| a | 73377 | |
| l | 65129 | |
| r | 60411 | |
| o | 35019 | 5.5% |
| u | 33269 | 5.3% |
| t | 31370 | 5.0% |
| i | 22799 | 3.6% |
| n | 19310 | 3.0% |
| Other values (16) | 101696 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 108212 | 9.8% |
| E | 107140 | 9.7% |
| A | 104196 | 9.4% |
| S | 94417 | 8.5% |
| T | 78115 | 7.0% |
| C | 77309 | 7.0% |
| O | 70795 | 6.4% |
| F | 67999 | 6.1% |
| P | 63135 | 5.7% |
| I | 57406 | 5.2% |
| Other values (16) | 279829 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 306324 | |
| " | 37927 | 9.3% |
| . | 36170 | 8.9% |
| & | 21322 | 5.2% |
| * | 4380 | 1.1% |
| / | 707 | 0.2% |
| ' | 102 | < 0.1% |
| : | 70 | < 0.1% |
| @ | 7 | < 0.1% |
| # | 6 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 163740 | |
| 2 | 160186 | |
| 1 | 134751 | |
| 3 | 64625 | 8.5% |
| 5 | 46047 | 6.1% |
| 7 | 43378 | 5.7% |
| 4 | 41268 | 5.4% |
| 8 | 37448 | 4.9% |
| 9 | 36888 | 4.8% |
| 6 | 32538 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 173179 | ||
| Â | 12 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 80380 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 346 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1741947 | |
| Common | 1421991 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 111836 | 6.4% |
| R | 108212 | 6.2% |
| E | 107140 | 6.2% |
| A | 104196 | 6.0% |
| S | 94417 | 5.4% |
| s | 79178 | 4.5% |
| T | 78115 | 4.5% |
| C | 77309 | 4.4% |
| a | 73377 | 4.2% |
| O | 70795 | 4.1% |
| Other values (42) | 837372 |
Common
| Value | Count | Frequency (%) |
| , | 306324 | |
| 173179 | ||
| 0 | 163740 | |
| 2 | 160186 | |
| 1 | 134751 | |
| - | 80380 | 5.7% |
| 3 | 64625 | 4.5% |
| 5 | 46047 | 3.2% |
| 7 | 43378 | 3.1% |
| 4 | 41268 | 2.9% |
| Other values (17) | 208113 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3163926 | |
| None | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| , | 306324 | 9.7% |
| 173179 | 5.5% | |
| 0 | 163740 | 5.2% |
| 2 | 160186 | 5.1% |
| 1 | 134751 | 4.3% |
| e | 111836 | 3.5% |
| R | 108212 | 3.4% |
| E | 107140 | 3.4% |
| A | 104196 | 3.3% |
| S | 94417 | 3.0% |
| Other values (68) | 1699945 |
None
| Value | Count | Frequency (%) |
| Â | 12 |
purpose
Text
MISSING 
| Distinct | 5431 |
|---|---|
| Distinct (%) | 69.0% |
| Missing | 12052 |
| Missing (%) | 60.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 95 |
|---|---|
| Median length | 66 |
| Mean length | 19.664592 |
| Min length | 1 |
Characters and Unicode
| Total characters | 154839 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4650 ? |
|---|---|
| Unique (%) | 59.1% |
Sample
| 1st row | Book: Canadian Urban Regions |
|---|---|
| 2nd row | update#4 to looseleaf publication:Planning&Zoning |
| 3rd row | LooseLeaf Publications updates |
| 4th row | update to looseleaf publication inv#10288391 |
| 5th row | Book: Rapid Graphs with Tableau Software 6 |
| Value | Count | Frequency (%) |
| for | 968 | 4.0% |
| more | 957 | 4.0% |
| to | 333 | 1.4% |
| 291 | 1.2% | |
| supplies | 267 | 1.1% |
| and | 237 | 1.0% |
| subscription | 215 | 0.9% |
| tweetymail | 211 | 0.9% |
| monthly | 209 | 0.9% |
| max | 206 | 0.9% |
| Other values (5092) | 20164 |
Most occurring characters
| Value | Count | Frequency (%) |
| 17335 | 11.2% | |
| E | 14083 | 9.1% |
| R | 10635 | 6.9% |
| T | 9488 | 6.1% |
| A | 9287 | 6.0% |
| S | 9247 | 6.0% |
| O | 8800 | 5.7% |
| I | 8244 | 5.3% |
| L | 6888 | 4.4% |
| N | 6588 | 4.3% |
| Other values (74) | 54244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 122774 | |
| Space Separator | 17335 | 11.2% |
| Other Punctuation | 5642 | 3.6% |
| Lowercase Letter | 4638 | 3.0% |
| Decimal Number | 3815 | 2.5% |
| Dash Punctuation | 482 | 0.3% |
| Open Punctuation | 71 | < 0.1% |
| Close Punctuation | 67 | < 0.1% |
| Connector Punctuation | 9 | < 0.1% |
| Math Symbol | 4 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 14083 | |
| R | 10635 | 8.7% |
| T | 9488 | 7.7% |
| A | 9287 | 7.6% |
| S | 9247 | 7.5% |
| O | 8800 | 7.2% |
| I | 8244 | 6.7% |
| L | 6888 | 5.6% |
| N | 6588 | 5.4% |
| C | 5521 | 4.5% |
| Other values (16) | 33993 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 556 | |
| a | 462 | 10.0% |
| r | 415 | 8.9% |
| o | 364 | 7.8% |
| t | 337 | 7.3% |
| s | 283 | 6.1% |
| l | 283 | 6.1% |
| n | 275 | 5.9% |
| i | 272 | 5.9% |
| c | 189 | 4.1% |
| Other values (16) | 1202 |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 3937 | |
| , | 646 | 11.4% |
| / | 467 | 8.3% |
| . | 213 | 3.8% |
| & | 157 | 2.8% |
| " | 131 | 2.3% |
| ' | 41 | 0.7% |
| # | 28 | 0.5% |
| : | 15 | 0.3% |
| @ | 4 | 0.1% |
| Other values (2) | 3 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 804 | |
| 2 | 590 | |
| 0 | 572 | |
| 4 | 444 | |
| 3 | 337 | |
| 5 | 305 | 8.0% |
| 8 | 269 | 7.1% |
| 6 | 252 | 6.6% |
| 7 | 139 | 3.6% |
| 9 | 103 | 2.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 70 | |
| [ | 1 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 | |
| = | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 17335 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 482 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 67 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 127412 | |
| Common | 27427 | 17.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 14083 | 11.1% |
| R | 10635 | 8.3% |
| T | 9488 | 7.4% |
| A | 9287 | 7.3% |
| S | 9247 | 7.3% |
| O | 8800 | 6.9% |
| I | 8244 | 6.5% |
| L | 6888 | 5.4% |
| N | 6588 | 5.2% |
| C | 5521 | 4.3% |
| Other values (42) | 38631 |
Common
| Value | Count | Frequency (%) |
| 17335 | ||
| * | 3937 | 14.4% |
| 1 | 804 | 2.9% |
| , | 646 | 2.4% |
| 2 | 590 | 2.2% |
| 0 | 572 | 2.1% |
| - | 482 | 1.8% |
| / | 467 | 1.7% |
| 4 | 444 | 1.6% |
| 3 | 337 | 1.2% |
| Other values (22) | 1813 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 154838 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 17335 | 11.2% | |
| E | 14083 | 9.1% |
| R | 10635 | 6.9% |
| T | 9488 | 6.1% |
| A | 9287 | 6.0% |
| S | 9247 | 6.0% |
| O | 8800 | 5.7% |
| I | 8244 | 5.3% |
| L | 6888 | 4.4% |
| N | 6588 | 4.3% |
| Other values (73) | 54243 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
merchant_name
Text
MISSING 
| Distinct | 1338 |
|---|---|
| Distinct (%) | 16.6% |
| Missing | 11854 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 16.675917 |
| Min length | 2 |
Characters and Unicode
| Total characters | 134608 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 697 ? |
|---|---|
| Unique (%) | 8.6% |
Sample
| 1st row | indigo books music |
|---|---|
| 2nd row | carswell |
| 3rd row | carswell |
| 4th row | rei lexisnexis canada |
| 5th row | createspace |
| Value | Count | Frequency (%) |
| home | 1260 | 5.6% |
| depot | 1259 | 5.6% |
| store | 857 | 3.8% |
| cdn | 749 | 3.3% |
| tire | 717 | 3.2% |
| supply | 369 | 1.6% |
| lowes | 261 | 1.2% |
| paypal | 243 | 1.1% |
| auto | 224 | 1.0% |
| tweetymail | 216 | 1.0% |
| Other values (1953) | 16328 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14411 | 10.7% | |
| e | 12700 | 9.4% |
| t | 9058 | 6.7% |
| o | 8393 | 6.2% |
| a | 8341 | 6.2% |
| s | 7609 | 5.7% |
| r | 6824 | 5.1% |
| i | 6469 | 4.8% |
| n | 6375 | 4.7% |
| l | 5586 | 4.1% |
| Other values (29) | 48842 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 105591 | |
| Decimal Number | 14604 | 10.8% |
| Space Separator | 14411 | 10.7% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12700 | |
| t | 9058 | 8.6% |
| o | 8393 | 7.9% |
| a | 8341 | 7.9% |
| s | 7609 | 7.2% |
| r | 6824 | 6.5% |
| i | 6469 | 6.1% |
| n | 6375 | 6.0% |
| l | 5586 | 5.3% |
| p | 5389 | 5.1% |
| Other values (16) | 28847 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4362 | |
| 1 | 2253 | |
| 7 | 2093 | |
| 2 | 1177 | 8.1% |
| 3 | 1040 | 7.1% |
| 4 | 902 | 6.2% |
| 6 | 801 | 5.5% |
| 5 | 719 | 4.9% |
| 9 | 704 | 4.8% |
| 8 | 553 | 3.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| W | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 14411 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 105593 | |
| Common | 29015 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 12700 | |
| t | 9058 | 8.6% |
| o | 8393 | 7.9% |
| a | 8341 | 7.9% |
| s | 7609 | 7.2% |
| r | 6824 | 6.5% |
| i | 6469 | 6.1% |
| n | 6375 | 6.0% |
| l | 5586 | 5.3% |
| p | 5389 | 5.1% |
| Other values (18) | 28849 |
Common
| Value | Count | Frequency (%) |
| 14411 | ||
| 0 | 4362 | 15.0% |
| 1 | 2253 | 7.8% |
| 7 | 2093 | 7.2% |
| 2 | 1177 | 4.1% |
| 3 | 1040 | 3.6% |
| 4 | 902 | 3.1% |
| 6 | 801 | 2.8% |
| 5 | 719 | 2.5% |
| 9 | 704 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 134608 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14411 | 10.7% | |
| e | 12700 | 9.4% |
| t | 9058 | 6.7% |
| o | 8393 | 6.2% |
| a | 8341 | 6.2% |
| s | 7609 | 5.7% |
| r | 6824 | 5.1% |
| i | 6469 | 4.8% |
| n | 6375 | 4.7% |
| l | 5586 | 4.1% |
| Other values (29) | 48842 |
cost_center_wbls_element_order_description
Text
MISSING 
| Distinct | 185 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 11865 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 50 |
|---|---|
| Median length | 39 |
| Mean length | 33.473266 |
| Min length | 9 |
Characters and Unicode
| Total characters | 269828 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | HEAD OFF-POLICY&RESR |
|---|---|
| 2nd row | HEAD OFF-POLICY&RESR |
| 3rd row | HEAD OFF-POLICY&RESR |
| 4th row | HEAD OFF-POLICY&RESR |
| 5th row | HEAD OFF-POLICY&RESR |
| Value | Count | Frequency (%) |
| 7726 | 17.6% | |
| wtr | 2085 | 4.8% |
| treat | 2077 | 4.7% |
| supply | 1658 | 3.8% |
| roadway | 1447 | 3.3% |
| tp | 1317 | 3.0% |
| ww | 1139 | 2.6% |
| transmission | 1128 | 2.6% |
| ops | 1034 | 2.4% |
| treatmnt | 948 | 2.2% |
| Other values (286) | 23298 |
Most occurring characters
| Value | Count | Frequency (%) |
| 37195 | ||
| T | 23800 | 8.8% |
| R | 21486 | 8.0% |
| A | 20103 | 7.5% |
| E | 19556 | 7.2% |
| S | 19047 | 7.1% |
| O | 12836 | 4.8% |
| P | 11537 | 4.3% |
| N | 10700 | 4.0% |
| W | 9718 | 3.6% |
| Other values (33) | 83850 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 210820 | |
| Space Separator | 37195 | 13.8% |
| Other Punctuation | 12255 | 4.5% |
| Dash Punctuation | 5951 | 2.2% |
| Decimal Number | 3607 | 1.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 23800 | |
| R | 21486 | |
| A | 20103 | 9.5% |
| E | 19556 | 9.3% |
| S | 19047 | 9.0% |
| O | 12836 | 6.1% |
| P | 11537 | 5.5% |
| N | 10700 | 5.1% |
| W | 9718 | 4.6% |
| I | 9697 | 4.6% |
| Other values (16) | 52340 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2043 | |
| 4 | 698 | 19.4% |
| 2 | 432 | 12.0% |
| 3 | 418 | 11.6% |
| 0 | 7 | 0.2% |
| 6 | 4 | 0.1% |
| 5 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 5491 | |
| : | 5279 | |
| , | 1009 | 8.2% |
| / | 279 | 2.3% |
| . | 159 | 1.3% |
| ' | 38 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 37195 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5951 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 210820 | |
| Common | 59008 | 21.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 23800 | |
| R | 21486 | |
| A | 20103 | 9.5% |
| E | 19556 | 9.3% |
| S | 19047 | 9.0% |
| O | 12836 | 6.1% |
| P | 11537 | 5.5% |
| N | 10700 | 5.1% |
| W | 9718 | 4.6% |
| I | 9697 | 4.6% |
| Other values (16) | 52340 |
Common
| Value | Count | Frequency (%) |
| 37195 | ||
| - | 5951 | 10.1% |
| & | 5491 | 9.3% |
| : | 5279 | 8.9% |
| 1 | 2043 | 3.5% |
| , | 1009 | 1.7% |
| 4 | 698 | 1.2% |
| 2 | 432 | 0.7% |
| 3 | 418 | 0.7% |
| / | 279 | 0.5% |
| Other values (7) | 213 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 269828 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 37195 | ||
| T | 23800 | 8.8% |
| R | 21486 | 8.0% |
| A | 20103 | 7.5% |
| E | 19556 | 7.2% |
| S | 19047 | 7.1% |
| O | 12836 | 4.8% |
| P | 11537 | 4.3% |
| N | 10700 | 4.0% |
| W | 9718 | 3.6% |
| Other values (33) | 83850 |
MISSING 
| Distinct | 1436 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 11854 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9993806 |
| Min length | 5 |
Characters and Unicode
| Total characters | 80715 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 555 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 2011-06-27 |
|---|---|
| 2nd row | 2011-06-08 |
| 3rd row | 2011-05-24 |
| 4th row | 2011-05-24 |
| 5th row | 2011-05-16 |
| Value | Count | Frequency (%) |
| 2018-04-26 | 38 | 0.5% |
| 2017-05-23 | 38 | 0.5% |
| 2018-09-20 | 38 | 0.5% |
| 2018-05-03 | 37 | 0.5% |
| 2018-01-11 | 36 | 0.4% |
| 2017-10-02 | 34 | 0.4% |
| 2017-05-29 | 34 | 0.4% |
| 2018-05-17 | 33 | 0.4% |
| 2018-04-05 | 32 | 0.4% |
| 2018-01-10 | 32 | 0.4% |
| Other values (1426) | 7720 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 17780 | |
| - | 16142 | |
| 1 | 15680 | |
| 2 | 13069 | |
| 8 | 5194 | 6.4% |
| 7 | 4054 | 5.0% |
| 6 | 1996 | 2.5% |
| 3 | 1924 | 2.4% |
| 5 | 1886 | 2.3% |
| 4 | 1594 | 2.0% |
| Other values (2) | 1396 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64572 | |
| Dash Punctuation | 16142 | 20.0% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 17780 | |
| 1 | 15680 | |
| 2 | 13069 | |
| 8 | 5194 | 8.0% |
| 7 | 4054 | 6.3% |
| 6 | 1996 | 3.1% |
| 3 | 1924 | 3.0% |
| 5 | 1886 | 2.9% |
| 4 | 1594 | 2.5% |
| 9 | 1395 | 2.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16142 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 80715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 17780 | |
| - | 16142 | |
| 1 | 15680 | |
| 2 | 13069 | |
| 8 | 5194 | 6.4% |
| 7 | 4054 | 5.0% |
| 6 | 1996 | 2.5% |
| 3 | 1924 | 2.4% |
| 5 | 1886 | 2.3% |
| 4 | 1594 | 2.0% |
| Other values (2) | 1396 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 17780 | |
| - | 16142 | |
| 1 | 15680 | |
| 2 | 13069 | |
| 8 | 5194 | 6.4% |
| 7 | 4054 | 5.0% |
| 6 | 1996 | 2.5% |
| 3 | 1924 | 2.4% |
| 5 | 1886 | 2.3% |
| 4 | 1594 | 2.0% |
| Other values (2) | 1396 | 1.7% |
MISSING 
| Distinct | 164 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 11854 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.9982656 |
| Min length | 3 |
Characters and Unicode
| Total characters | 48418 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 5399.0 |
|---|---|
| 2nd row | 7338.0 |
| 3rd row | 7338.0 |
| 4th row | 5942.0 |
| 5th row | 7829.0 |
| Value | Count | Frequency (%) |
| 5200.0 | 2235 | |
| 5085.0 | 843 | 10.4% |
| 5251.0 | 567 | 7.0% |
| 9399.0 | 298 | 3.7% |
| 5211.0 | 284 | 3.5% |
| 7372.0 | 235 | 2.9% |
| 5943.0 | 190 | 2.4% |
| 5732.0 | 181 | 2.2% |
| 5261.0 | 175 | 2.2% |
| 4812.0 | 155 | 1.9% |
| Other values (154) | 2909 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14272 | |
| 5 | 8102 | |
| . | 8071 | |
| 2 | 4768 | 9.8% |
| 1 | 3042 | 6.3% |
| 9 | 2541 | 5.2% |
| 3 | 2076 | 4.3% |
| 7 | 1744 | 3.6% |
| 8 | 1643 | 3.4% |
| 4 | 1303 | 2.7% |
| Other values (4) | 856 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40344 | |
| Other Punctuation | 8071 | 16.7% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 14272 | |
| 5 | 8102 | |
| 2 | 4768 | 11.8% |
| 1 | 3042 | 7.5% |
| 9 | 2541 | 6.3% |
| 3 | 2076 | 5.1% |
| 7 | 1744 | 4.3% |
| 8 | 1643 | 4.1% |
| 4 | 1303 | 3.2% |
| 6 | 853 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| A | 1 | |
| D | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8071 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 48415 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 14272 | |
| 5 | 8102 | |
| . | 8071 | |
| 2 | 4768 | 9.8% |
| 1 | 3042 | 6.3% |
| 9 | 2541 | 5.2% |
| 3 | 2076 | 4.3% |
| 7 | 1744 | 3.6% |
| 8 | 1643 | 3.4% |
| 4 | 1303 | 2.7% |
Latin
| Value | Count | Frequency (%) |
| C | 1 | |
| A | 1 | |
| D | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48418 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 14272 | |
| 5 | 8102 | |
| . | 8071 | |
| 2 | 4768 | 9.8% |
| 1 | 3042 | 6.3% |
| 9 | 2541 | 5.2% |
| 3 | 2076 | 4.3% |
| 7 | 1744 | 3.6% |
| 8 | 1643 | 3.4% |
| 4 | 1303 | 2.7% |
| Other values (4) | 856 | 1.8% |
MISSING 
| Distinct | 164 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 11854 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 257 |
|---|---|
| Median length | 40 |
| Mean length | 29.34502 |
| Min length | 5 |
Characters and Unicode
| Total characters | 236873 |
|---|---|
| Distinct characters | 68 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Miscellaneous General Merchandise |
|---|---|
| 2nd row | Quick Copy, Reproduction, and Blueprinti |
| 3rd row | Quick Copy, Reproduction, and Blueprinti |
| 4th row | Book Stores |
| 5th row | Motion Picture and Video Tape Production |
| Value | Count | Frequency (%) |
| supply | 2629 | 8.4% |
| home | 2309 | 7.4% |
| warehouse | 2235 | 7.1% |
| stores | 1633 | 5.2% |
| not | 1512 | 4.8% |
| elsewhere | 1465 | 4.7% |
| and | 1452 | 4.6% |
| supplies | 977 | 3.1% |
| classi | 918 | 2.9% |
| industrial | 843 | 2.7% |
| Other values (368) | 15299 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 26935 | 11.4% |
| 23203 | 9.8% | |
| r | 15835 | 6.7% |
| s | 14997 | 6.3% |
| a | 13642 | 5.8% |
| o | 13204 | 5.6% |
| l | 11677 | 4.9% |
| t | 11504 | 4.9% |
| i | 11385 | 4.8% |
| u | 10135 | 4.3% |
| Other values (58) | 84356 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 180751 | |
| Uppercase Letter | 29210 | 12.3% |
| Space Separator | 23203 | 9.8% |
| Other Punctuation | 2747 | 1.2% |
| Dash Punctuation | 822 | 0.3% |
| Open Punctuation | 82 | < 0.1% |
| Decimal Number | 56 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26935 | |
| r | 15835 | 8.8% |
| s | 14997 | 8.3% |
| a | 13642 | 7.5% |
| o | 13204 | 7.3% |
| l | 11677 | 6.5% |
| t | 11504 | 6.4% |
| i | 11385 | 6.3% |
| u | 10135 | 5.6% |
| p | 9214 | 5.1% |
| Other values (15) | 42223 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 8154 | |
| H | 3064 | 10.5% |
| C | 2879 | 9.9% |
| E | 2559 | 8.8% |
| W | 2491 | 8.5% |
| N | 1650 | 5.6% |
| I | 1374 | 4.7% |
| P | 1034 | 3.5% |
| G | 988 | 3.4% |
| M | 880 | 3.0% |
| Other values (14) | 4137 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 16 | |
| 1 | 13 | |
| 0 | 11 | |
| 4 | 5 | 8.9% |
| 5 | 3 | 5.4% |
| 7 | 3 | 5.4% |
| 8 | 3 | 5.4% |
| 6 | 1 | 1.8% |
| 3 | 1 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2587 | |
| / | 130 | 4.7% |
| & | 21 | 0.8% |
| " | 4 | 0.1% |
| . | 3 | 0.1% |
| ' | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 23203 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 822 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 82 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 209961 | |
| Common | 26912 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 26935 | 12.8% |
| r | 15835 | 7.5% |
| s | 14997 | 7.1% |
| a | 13642 | 6.5% |
| o | 13204 | 6.3% |
| l | 11677 | 5.6% |
| t | 11504 | 5.5% |
| i | 11385 | 5.4% |
| u | 10135 | 4.8% |
| p | 9214 | 4.4% |
| Other values (39) | 71433 |
Common
| Value | Count | Frequency (%) |
| 23203 | ||
| , | 2587 | 9.6% |
| - | 822 | 3.1% |
| / | 130 | 0.5% |
| ( | 82 | 0.3% |
| & | 21 | 0.1% |
| 2 | 16 | 0.1% |
| 1 | 13 | < 0.1% |
| 0 | 11 | < 0.1% |
| 4 | 5 | < 0.1% |
| Other values (9) | 22 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 236873 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 26935 | 11.4% |
| 23203 | 9.8% | |
| r | 15835 | 6.7% |
| s | 14997 | 6.3% |
| a | 13642 | 5.8% |
| o | 13204 | 5.6% |
| l | 11677 | 4.9% |
| t | 11504 | 4.9% |
| i | 11385 | 4.8% |
| u | 10135 | 4.3% |
| Other values (58) | 84356 |
original_currency1
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| CAD | |
|---|---|
| USD | 175 |
| CHF | 2 |
| GBP | 1 |
| 34.23 | 1 |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.0002478 |
| Min length | 3 |
Characters and Unicode
| Total characters | 24215 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CAD |
|---|---|
| 2nd row | CAD |
| 3rd row | CAD |
| 4th row | CAD |
| 5th row | USD |
Common Values
| Value | Count | Frequency (%) |
| CAD | 7892 | |
| USD | 175 | 0.9% |
| CHF | 2 | < 0.1% |
| GBP | 1 | < 0.1% |
| 34.23 | 1 | < 0.1% |
| (Missing) | 11855 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cad | 7892 | |
| usd | 175 | 2.2% |
| chf | 2 | < 0.1% |
| gbp | 1 | < 0.1% |
| 34.23 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 8067 | |
| C | 7894 | |
| A | 7892 | |
| U | 175 | 0.7% |
| S | 175 | 0.7% |
| H | 2 | < 0.1% |
| F | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| G | 1 | < 0.1% |
| B | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 24210 | |
| Decimal Number | 4 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 8067 | |
| C | 7894 | |
| A | 7892 | |
| U | 175 | 0.7% |
| S | 175 | 0.7% |
| H | 2 | < 0.1% |
| F | 2 | < 0.1% |
| G | 1 | < 0.1% |
| B | 1 | < 0.1% |
| P | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 4 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24210 | |
| Common | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 8067 | |
| C | 7894 | |
| A | 7892 | |
| U | 175 | 0.7% |
| S | 175 | 0.7% |
| H | 2 | < 0.1% |
| F | 2 | < 0.1% |
| G | 1 | < 0.1% |
| B | 1 | < 0.1% |
| P | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 4 | 1 | |
| . | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24215 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 8067 | |
| C | 7894 | |
| A | 7892 | |
| U | 175 | 0.7% |
| S | 175 | 0.7% |
| H | 2 | < 0.1% |
| F | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| G | 1 | < 0.1% |
| B | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
cost_center_wbls_element_order
Text
MISSING 
| Distinct | 133 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 11857 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.039534 |
| Min length | 5 |
Characters and Unicode
| Total characters | 48733 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 25 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | UR0005 |
|---|---|
| 2nd row | UR0005 |
| 3rd row | UR0005 |
| 4th row | UR0005 |
| 5th row | UR0005 |
| Value | Count | Frequency (%) |
| tw7070 | 952 | 11.8% |
| tp0219 | 582 | 7.2% |
| tw7072 | 425 | 5.3% |
| tw2040 | 335 | 4.2% |
| tw2060 | 329 | 4.1% |
| tw4080 | 318 | 3.9% |
| tp0333 | 296 | 3.7% |
| tw2035 | 292 | 3.6% |
| tp0124 | 274 | 3.4% |
| tw7075 | 252 | 3.1% |
| Other values (124) | 4015 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12169 | |
| T | 7958 | |
| W | 5350 | |
| 7 | 4087 | 8.4% |
| 2 | 3729 | 7.7% |
| P | 2777 | 5.7% |
| 1 | 2549 | 5.2% |
| 4 | 2385 | 4.9% |
| 5 | 2262 | 4.6% |
| 3 | 1858 | 3.8% |
| Other values (13) | 3609 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32377 | |
| Uppercase Letter | 16235 | |
| Dash Punctuation | 105 | 0.2% |
| Other Punctuation | 15 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12169 | |
| 7 | 4087 | 12.6% |
| 2 | 3729 | 11.5% |
| 1 | 2549 | 7.9% |
| 4 | 2385 | 7.4% |
| 5 | 2262 | 7.0% |
| 3 | 1858 | 5.7% |
| 6 | 1551 | 4.8% |
| 9 | 1184 | 3.7% |
| 8 | 603 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 7958 | |
| W | 5350 | |
| P | 2777 | 17.1% |
| C | 91 | 0.6% |
| R | 29 | 0.2% |
| U | 21 | 0.1% |
| N | 4 | < 0.1% |
| O | 3 | < 0.1% |
| A | 1 | < 0.1% |
| E | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105 |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 15 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32498 | |
| Latin | 16235 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12169 | |
| 7 | 4087 | 12.6% |
| 2 | 3729 | 11.5% |
| 1 | 2549 | 7.8% |
| 4 | 2385 | 7.3% |
| 5 | 2262 | 7.0% |
| 3 | 1858 | 5.7% |
| 6 | 1551 | 4.8% |
| 9 | 1184 | 3.6% |
| 8 | 603 | 1.9% |
| Other values (3) | 121 | 0.4% |
Latin
| Value | Count | Frequency (%) |
| T | 7958 | |
| W | 5350 | |
| P | 2777 | 17.1% |
| C | 91 | 0.6% |
| R | 29 | 0.2% |
| U | 21 | 0.1% |
| N | 4 | < 0.1% |
| O | 3 | < 0.1% |
| A | 1 | < 0.1% |
| E | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12169 | |
| T | 7958 | |
| W | 5350 | |
| 7 | 4087 | 8.4% |
| 2 | 3729 | 7.7% |
| P | 2777 | 5.7% |
| 1 | 2549 | 5.2% |
| 4 | 2385 | 4.9% |
| 5 | 2262 | 4.6% |
| 3 | 1858 | 3.8% |
| Other values (13) | 3609 | 7.4% |
transaction_date
Text
MISSING 
| Distinct | 1523 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9992566 |
| Min length | 4 |
Characters and Unicode
| Total characters | 80704 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 618 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | 2011-06-26 |
|---|---|
| 2nd row | 2011-06-07 |
| 3rd row | 2011-05-20 |
| 4th row | 2011-05-20 |
| 5th row | 2011-05-14 |
| Value | Count | Frequency (%) |
| 2018-01-09 | 32 | 0.4% |
| 2017-07-11 | 32 | 0.4% |
| 2018-01-05 | 30 | 0.4% |
| 2017-06-05 | 29 | 0.4% |
| 2017-11-14 | 29 | 0.4% |
| 2018-09-20 | 28 | 0.3% |
| 2018-03-14 | 28 | 0.3% |
| 2017-06-14 | 27 | 0.3% |
| 2017-08-17 | 26 | 0.3% |
| 2018-02-02 | 26 | 0.3% |
| Other values (1513) | 7784 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 17909 | |
| - | 16140 | |
| 1 | 15638 | |
| 2 | 13015 | |
| 8 | 5162 | 6.4% |
| 7 | 4118 | 5.1% |
| 6 | 1970 | 2.4% |
| 5 | 1846 | 2.3% |
| 3 | 1814 | 2.2% |
| 4 | 1632 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64564 | |
| Dash Punctuation | 16140 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 17909 | |
| 1 | 15638 | |
| 2 | 13015 | |
| 8 | 5162 | 8.0% |
| 7 | 4118 | 6.4% |
| 6 | 1970 | 3.1% |
| 5 | 1846 | 2.9% |
| 3 | 1814 | 2.8% |
| 4 | 1632 | 2.5% |
| 9 | 1460 | 2.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 80704 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 17909 | |
| - | 16140 | |
| 1 | 15638 | |
| 2 | 13015 | |
| 8 | 5162 | 6.4% |
| 7 | 4118 | 5.1% |
| 6 | 1970 | 2.4% |
| 5 | 1846 | 2.3% |
| 3 | 1814 | 2.2% |
| 4 | 1632 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80704 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 17909 | |
| - | 16140 | |
| 1 | 15638 | |
| 2 | 13015 | |
| 8 | 5162 | 6.4% |
| 7 | 4118 | 5.1% |
| 6 | 1970 | 2.4% |
| 5 | 1846 | 2.3% |
| 3 | 1814 | 2.2% |
| 4 | 1632 | 2.0% |
MISSING 
| Distinct | 5653 |
|---|---|
| Distinct (%) | 70.0% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 5.3402305 |
| Min length | 3 |
Characters and Unicode
| Total characters | 43101 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4640 ? |
|---|---|
| Unique (%) | 57.5% |
Sample
| 1st row | 60.85 |
|---|---|
| 2nd row | 273.45 |
| 3rd row | 281.85 |
| 4th row | 91.3 |
| 5th row | 45.82 |
| Value | Count | Frequency (%) |
| 145.0 | 56 | 0.7% |
| 175.0 | 42 | 0.5% |
| 50.0 | 35 | 0.4% |
| 140.0 | 29 | 0.4% |
| 40.0 | 26 | 0.3% |
| 33.89 | 24 | 0.3% |
| 95.31 | 21 | 0.3% |
| 45.19 | 18 | 0.2% |
| 248.6 | 18 | 0.2% |
| 56.49 | 18 | 0.2% |
| Other values (5643) | 7784 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 8070 | |
| 1 | 5132 | |
| 2 | 4149 | |
| 5 | 3718 | |
| 4 | 3593 | |
| 3 | 3584 | |
| 6 | 3233 | |
| 7 | 3054 | 7.1% |
| 8 | 3046 | 7.1% |
| 9 | 3025 | 7.0% |
| Other values (2) | 2497 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 35030 | |
| Other Punctuation | 8070 | 18.7% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 2 | 4149 | |
| 5 | 3718 | |
| 4 | 3593 | |
| 3 | 3584 | |
| 6 | 3233 | |
| 7 | 3054 | |
| 8 | 3046 | |
| 9 | 3025 | |
| 0 | 2496 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8070 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 43101 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 8070 | |
| 1 | 5132 | |
| 2 | 4149 | |
| 5 | 3718 | |
| 4 | 3593 | |
| 3 | 3584 | |
| 6 | 3233 | |
| 7 | 3054 | 7.1% |
| 8 | 3046 | 7.1% |
| 9 | 3025 | 7.0% |
| Other values (2) | 2497 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43101 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 8070 | |
| 1 | 5132 | |
| 2 | 4149 | |
| 5 | 3718 | |
| 4 | 3593 | |
| 3 | 3584 | |
| 6 | 3233 | |
| 7 | 3054 | 7.1% |
| 8 | 3046 | 7.1% |
| 9 | 3025 | 7.0% |
| Other values (2) | 2497 | 5.8% |
trx_currency
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| CAD | |
|---|---|
| False | 1 |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.0002478 |
| Min length | 3 |
Characters and Unicode
| Total characters | 24215 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CAD |
|---|---|
| 2nd row | CAD |
| 3rd row | CAD |
| 4th row | CAD |
| 5th row | CAD |
Common Values
| Value | Count | Frequency (%) |
| CAD | 8070 | |
| False | 1 | < 0.1% |
| (Missing) | 11855 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cad | 8070 | |
| false | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 8070 | |
| A | 8070 | |
| D | 8070 | |
| F | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| s | 1 | < 0.1% |
| e | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 24211 | |
| Lowercase Letter | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 8070 | |
| A | 8070 | |
| D | 8070 | |
| F | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24215 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 8070 | |
| A | 8070 | |
| D | 8070 | |
| F | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| s | 1 | < 0.1% |
| e | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24215 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 8070 | |
| A | 8070 | |
| D | 8070 | |
| F | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| s | 1 | < 0.1% |
| e | 1 | < 0.1% |
MISSING 
| Distinct | 165 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 40 |
|---|---|
| Median length | 38 |
| Mean length | 21.514434 |
| Min length | 1 |
Characters and Unicode
| Total characters | 173643 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 34 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | BOOK & MAGAZINE SUBSCRIPTIONS |
|---|---|
| 2nd row | BOOK & MAGAZINE SUBSCRIPTIONS |
| 3rd row | BOOK & MAGAZINE SUBSCRIPTIONS |
| 4th row | BOOK & MAGAZINE SUBSCRIPTIONS |
| 5th row | BOOK & MAGAZINE SUBSCRIPTIONS |
| Value | Count | Frequency (%) |
| 4523 | ||
| supplies | 2113 | 8.4% |
| general | 1814 | 7.2% |
| hardware | 1749 | 6.9% |
| materials | 1147 | 4.5% |
| equipment | 838 | 3.3% |
| parts | 689 | 2.7% |
| machinery | 670 | 2.7% |
| miscellaneous | 646 | 2.6% |
| m | 495 | 2.0% |
| Other values (238) | 10586 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 22000 | |
| 17199 | 9.9% | |
| A | 14461 | 8.3% |
| R | 13710 | 7.9% |
| S | 12580 | 7.2% |
| I | 11834 | 6.8% |
| L | 9812 | 5.7% |
| N | 9696 | 5.6% |
| P | 7881 | 4.5% |
| T | 7566 | 4.4% |
| Other values (26) | 46904 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 151503 | |
| Space Separator | 17199 | 9.9% |
| Other Punctuation | 3028 | 1.7% |
| Dash Punctuation | 1858 | 1.1% |
| Open Punctuation | 27 | < 0.1% |
| Close Punctuation | 27 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 22000 | |
| A | 14461 | 9.5% |
| R | 13710 | 9.0% |
| S | 12580 | 8.3% |
| I | 11834 | 7.8% |
| L | 9812 | 6.5% |
| N | 9696 | 6.4% |
| P | 7881 | 5.2% |
| T | 7566 | 5.0% |
| M | 5483 | 3.6% |
| Other values (16) | 36480 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 2845 | |
| / | 126 | 4.2% |
| , | 36 | 1.2% |
| . | 20 | 0.7% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 17199 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1858 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 27 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 27 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 151503 | |
| Common | 22140 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 22000 | |
| A | 14461 | 9.5% |
| R | 13710 | 9.0% |
| S | 12580 | 8.3% |
| I | 11834 | 7.8% |
| L | 9812 | 6.5% |
| N | 9696 | 6.4% |
| P | 7881 | 5.2% |
| T | 7566 | 5.0% |
| M | 5483 | 3.6% |
| Other values (16) | 36480 |
Common
| Value | Count | Frequency (%) |
| 17199 | ||
| & | 2845 | 12.9% |
| - | 1858 | 8.4% |
| / | 126 | 0.6% |
| , | 36 | 0.2% |
| ( | 27 | 0.1% |
| ) | 27 | 0.1% |
| . | 20 | 0.1% |
| 0 | 1 | < 0.1% |
| # | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 173643 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 22000 | |
| 17199 | 9.9% | |
| A | 14461 | 8.3% |
| R | 13710 | 7.9% |
| S | 12580 | 7.2% |
| I | 11834 | 6.8% |
| L | 9812 | 5.7% |
| N | 9696 | 5.6% |
| P | 7881 | 4.5% |
| T | 7566 | 4.4% |
| Other values (26) | 46904 |
original_amount
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 5607 |
|---|---|
| Distinct (%) | 69.5% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 250.9653 |
| Minimum | 0.01 |
|---|---|
| Maximum | 201807 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 155.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 8.395 |
| Q1 | 41.595 |
| median | 107.33 |
| Q3 | 248.6 |
| 95-th percentile | 860.915 |
| Maximum | 201807 |
| Range | 201806.99 |
| Interquartile range (IQR) | 207.005 |
Descriptive statistics
| Standard deviation | 2271.8371 |
|---|---|
| Coefficient of variation (CV) | 9.0523954 |
| Kurtosis | 7679.8877 |
| Mean | 250.9653 |
| Median Absolute Deviation (MAD) | 77.63 |
| Skewness | 86.568521 |
| Sum | 2025540.9 |
| Variance | 5161243.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 145 | 56 | 0.3% |
| 50 | 48 | 0.2% |
| 175 | 44 | 0.2% |
| 140 | 29 | 0.1% |
| 40 | 28 | 0.1% |
| 33.89 | 24 | 0.1% |
| 95.31 | 21 | 0.1% |
| 45.19 | 18 | 0.1% |
| 248.6 | 18 | 0.1% |
| 56.49 | 18 | 0.1% |
| Other values (5597) | 7767 | |
| (Missing) | 11855 |
| Value | Count | Frequency (%) |
| 0.01 | 1 | |
| 0.49 | 1 | |
| 0.66 | 1 | |
| 0.78 | 1 | |
| 0.86 | 1 | |
| 0.96 | 1 | |
| 1.07 | 1 | |
| 1.2 | 1 | |
| 1.23 | 1 | |
| 1.32 | 1 |
| Value | Count | Frequency (%) |
| 201807 | 1 | |
| 4013.52 | 1 | |
| 3288.58 | 1 | |
| 2998.17 | 1 | |
| 2988.75 | 1 | |
| 2959.72 | 1 | |
| 2956.36 | 1 | |
| 2948.74 | 2 | |
| 2943.65 | 1 | |
| 2932.35 | 1 |
division
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| TORONTO WATER | |
|---|---|
| TRANSPORTATION SERVICES | |
| TRANSPORTATION | |
| URBAN PLANNING | 21 |
| TREASURER | 6 |
Length
| Max length | 24 |
|---|---|
| Median length | 13 |
| Mean length | 16.109156 |
| Min length | 4 |
Characters and Unicode
| Total characters | 130017 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | URBAN PLANNING |
|---|---|
| 2nd row | URBAN PLANNING |
| 3rd row | URBAN PLANNING |
| 4th row | URBAN PLANNING |
| 5th row | URBAN PLANNING |
Common Values
| Value | Count | Frequency (%) |
| TORONTO WATER | 5277 | |
| TRANSPORTATION SERVICES | 2234 | 11.2% |
| TRANSPORTATION | 532 | 2.7% |
| URBAN PLANNING | 21 | 0.1% |
| TREASURER | 6 | < 0.1% |
| 2018 | 1 | < 0.1% |
| (Missing) | 11855 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| toronto | 5277 | |
| water | 5277 | |
| transportation | 2766 | |
| services | 2234 | |
| urban | 21 | 0.1% |
| planning | 21 | 0.1% |
| treasurer | 6 | < 0.1% |
| 2018 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 24135 | |
| O | 21363 | |
| R | 18359 | |
| N | 10893 | |
| A | 10857 | |
| 9766 | ||
| E | 9757 | |
| S | 7240 | 5.6% |
| W | 5277 | 4.1% |
| I | 5021 | 3.9% |
| Other values (11) | 7349 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 120247 | |
| Space Separator | 9766 | 7.5% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 24135 | |
| O | 21363 | |
| R | 18359 | |
| N | 10893 | |
| A | 10857 | |
| E | 9757 | |
| S | 7240 | 6.0% |
| W | 5277 | 4.4% |
| I | 5021 | 4.2% |
| P | 2787 | 2.3% |
| Other values (6) | 4558 | 3.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 0 | 1 | |
| 1 | 1 | |
| 8 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 9766 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 120247 | |
| Common | 9770 | 7.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 24135 | |
| O | 21363 | |
| R | 18359 | |
| N | 10893 | |
| A | 10857 | |
| E | 9757 | |
| S | 7240 | 6.0% |
| W | 5277 | 4.4% |
| I | 5021 | 4.2% |
| P | 2787 | 2.3% |
| Other values (6) | 4558 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 9766 | ||
| 2 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 130017 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 24135 | |
| O | 21363 | |
| R | 18359 | |
| N | 10893 | |
| A | 10857 | |
| 9766 | ||
| E | 9757 | |
| S | 7240 | 5.6% |
| W | 5277 | 4.1% |
| I | 5021 | 3.9% |
| Other values (11) | 7349 | 5.7% |
gl_account
Text
MISSING 
| Distinct | 214 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.1058109 |
| Min length | 4 |
Characters and Unicode
| Total characters | 33138 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 2020.0 |
|---|---|
| 2nd row | 2020.0 |
| 3rd row | 2020.0 |
| 4th row | 2020.0 |
| 5th row | 2020.0 |
| Value | Count | Frequency (%) |
| 2710 | 1563 | |
| 2120 | 594 | 7.4% |
| 2999 | 567 | 7.0% |
| 2552 | 419 | 5.2% |
| 3080 | 313 | 3.9% |
| 2535 | 302 | 3.7% |
| 2530 | 273 | 3.4% |
| 2575 | 243 | 3.0% |
| 216 | 2.7% | |
| 2715 | 192 | 2.4% |
| Other values (204) | 3389 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 7238 | |
| 0 | 5901 | |
| 1 | 3631 | |
| 5 | 3480 | |
| 9 | 2814 | 8.5% |
| 4 | 2592 | 7.8% |
| 7 | 2501 | 7.5% |
| 3 | 1969 | 5.9% |
| * | 1080 | 3.3% |
| 8 | 1036 | 3.1% |
| Other values (6) | 896 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 31735 | |
| Other Punctuation | 1399 | 4.2% |
| Lowercase Letter | 3 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7238 | |
| 0 | 5901 | |
| 1 | 3631 | |
| 5 | 3480 | |
| 9 | 2814 | 8.9% |
| 4 | 2592 | 8.2% |
| 7 | 2501 | 7.9% |
| 3 | 1969 | 6.2% |
| 8 | 1036 | 3.3% |
| 6 | 573 | 1.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1 | |
| u | 1 | |
| e | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 1080 | |
| . | 319 | 22.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 33134 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 7238 | |
| 0 | 5901 | |
| 1 | 3631 | |
| 5 | 3480 | |
| 9 | 2814 | 8.5% |
| 4 | 2592 | 7.8% |
| 7 | 2501 | 7.5% |
| 3 | 1969 | 5.9% |
| * | 1080 | 3.3% |
| 8 | 1036 | 3.1% |
| Other values (2) | 892 | 2.7% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33138 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 7238 | |
| 0 | 5901 | |
| 1 | 3631 | |
| 5 | 3480 | |
| 9 | 2814 | 8.5% |
| 4 | 2592 | 7.8% |
| 7 | 2501 | 7.5% |
| 3 | 1969 | 5.9% |
| * | 1080 | 3.3% |
| 8 | 1036 | 3.1% |
| Other values (6) | 896 | 2.7% |
MISSING 
| Distinct | 8063 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.441581 |
| Min length | 5 |
Characters and Unicode
| Total characters | 60061 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8055 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | 1521-44 |
|---|---|
| 2nd row | 1494-34 |
| 3rd row | 1471-36 |
| 4th row | 1471-37 |
| 5th row | 1461-33 |
| Value | Count | Frequency (%) |
| 5020-56 | 2 | < 0.1% |
| 5020-46 | 2 | < 0.1% |
| 2481-8 | 2 | < 0.1% |
| 2560-310 | 2 | < 0.1% |
| 4228-237 | 2 | < 0.1% |
| 4228-81 | 2 | < 0.1% |
| 4228-139 | 2 | < 0.1% |
| 5021-121 | 2 | < 0.1% |
| 1365-34 | 1 | < 0.1% |
| 1329-39 | 1 | < 0.1% |
| Other values (8053) | 8053 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 8348 | |
| - | 8070 | |
| 1 | 6864 | |
| 5 | 5615 | |
| 2 | 4916 | |
| 9 | 4708 | |
| 3 | 4560 | |
| 0 | 4397 | |
| 7 | 4302 | |
| 8 | 4273 | |
| Other values (6) | 4008 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 51986 | |
| Dash Punctuation | 8070 | 13.4% |
| Lowercase Letter | 4 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 8348 | |
| 1 | 6864 | |
| 5 | 5615 | |
| 2 | 4916 | |
| 9 | 4708 | |
| 3 | 4560 | |
| 0 | 4397 | |
| 7 | 4302 | |
| 8 | 4273 | |
| 6 | 4003 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8070 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 60056 | |
| Latin | 5 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 8348 | |
| - | 8070 | |
| 1 | 6864 | |
| 5 | 5615 | |
| 2 | 4916 | |
| 9 | 4708 | |
| 3 | 4560 | |
| 0 | 4397 | |
| 7 | 4302 | |
| 8 | 4273 |
Latin
| Value | Count | Frequency (%) |
| F | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 60061 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 8348 | |
| - | 8070 | |
| 1 | 6864 | |
| 5 | 5615 | |
| 2 | 4916 | |
| 9 | 4708 | |
| 3 | 4560 | |
| 0 | 4397 | |
| 7 | 4302 | |
| 8 | 4273 | |
| Other values (6) | 4008 |
transaction_gt_50
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| True | |
|---|---|
| False | |
| (Missing) |
| Value | Count | Frequency (%) |
| True | 4207 | 21.1% |
| False | 3864 | 19.4% |
| (Missing) | 11855 |
transaction_day_of_week
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11855 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| 3 | |
|---|---|
| 1 | |
| 2 | |
| 4 | |
| 0 | |
| Other values (3) |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.0004956 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8075 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 1 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 5 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 1694 | 8.5% |
| 1 | 1684 | 8.5% |
| 2 | 1676 | 8.4% |
| 4 | 1489 | 7.5% |
| 0 | 1227 | 6.2% |
| 5 | 184 | 0.9% |
| 6 | 116 | 0.6% |
| False | 1 | < 0.1% |
| (Missing) | 11855 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 1694 | |
| 1 | 1684 | |
| 2 | 1676 | |
| 4 | 1489 | |
| 0 | 1227 | |
| 5 | 184 | 2.3% |
| 6 | 116 | 1.4% |
| false | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1694 | |
| 1 | 1684 | |
| 2 | 1676 | |
| 4 | 1489 | |
| 0 | 1227 | |
| 5 | 184 | 2.3% |
| 6 | 116 | 1.4% |
| F | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8070 | |
| Lowercase Letter | 4 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1694 | |
| 1 | 1684 | |
| 2 | 1676 | |
| 4 | 1489 | |
| 0 | 1227 | |
| 5 | 184 | 2.3% |
| 6 | 116 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8070 | |
| Latin | 5 | 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1694 | |
| 1 | 1684 | |
| 2 | 1676 | |
| 4 | 1489 | |
| 0 | 1227 | |
| 5 | 184 | 2.3% |
| 6 | 116 | 1.4% |
Latin
| Value | Count | Frequency (%) |
| F | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8075 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1694 | |
| 1 | 1684 | |
| 2 | 1676 | |
| 4 | 1489 | |
| 0 | 1227 | |
| 5 | 184 | 2.3% |
| 6 | 116 | 1.4% |
| F | 1 | < 0.1% |
| a | 1 | < 0.1% |
| l | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
transaction_month
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 96 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 11856 |
| Missing (%) | 59.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201678.24 |
| Minimum | 201101 |
|---|---|
| Maximum | 201812 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 155.8 KiB |
Quantile statistics
| Minimum | 201101 |
|---|---|
| 5-th percentile | 201112 |
| Q1 | 201705 |
| median | 201711 |
| Q3 | 201805 |
| 95-th percentile | 201811 |
| Maximum | 201812 |
| Range | 711 |
| Interquartile range (IQR) | 100 |
Descriptive statistics
| Standard deviation | 194.13604 |
|---|---|
| Coefficient of variation (CV) | 0.00096260281 |
| Kurtosis | 2.4483956 |
| Mean | 201678.24 |
| Median Absolute Deviation (MAD) | 93 |
| Skewness | -1.869686 |
| Sum | 1.6275434 × 109 |
| Variance | 37688.803 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 201801 | 427 | 2.1% |
| 201706 | 392 | 2.0% |
| 201805 | 384 | 1.9% |
| 201804 | 369 | 1.9% |
| 201711 | 337 | 1.7% |
| 201803 | 329 | 1.7% |
| 201708 | 319 | 1.6% |
| 201710 | 318 | 1.6% |
| 201707 | 313 | 1.6% |
| 201802 | 311 | 1.6% |
| Other values (86) | 4571 | 22.9% |
| (Missing) | 11856 |
| Value | Count | Frequency (%) |
| 201101 | 59 | |
| 201102 | 39 | |
| 201103 | 66 | |
| 201104 | 55 | |
| 201105 | 58 | |
| 201106 | 43 | |
| 201107 | 16 | 0.1% |
| 201108 | 16 | 0.1% |
| 201109 | 13 | 0.1% |
| 201110 | 10 | 0.1% |
| Value | Count | Frequency (%) |
| 201812 | 193 | |
| 201811 | 304 | |
| 201810 | 289 | |
| 201809 | 299 | |
| 201808 | 290 | |
| 201807 | 273 | |
| 201806 | 265 | |
| 201805 | 384 | |
| 201804 | 369 | |
| 201803 | 329 |
transaction_year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 11856 |
| Missing (%) | 59.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.7154 |
| Minimum | 2011 |
|---|---|
| Maximum | 2018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 155.8 KiB |
Quantile statistics
| Minimum | 2011 |
|---|---|
| 5-th percentile | 2011 |
| Q1 | 2017 |
| median | 2017 |
| Q3 | 2018 |
| 95-th percentile | 2018 |
| Maximum | 2018 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.9386167 |
|---|---|
| Coefficient of variation (CV) | 0.00096127431 |
| Kurtosis | 2.4164105 |
| Mean | 2016.7154 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.858976 |
| Sum | 16274893 |
| Variance | 3.7582346 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2018 | 3733 | 18.7% |
| 2017 | 2607 | 13.1% |
| 2011 | 410 | 2.1% |
| 2016 | 375 | 1.9% |
| 2015 | 288 | 1.4% |
| 2014 | 264 | 1.3% |
| 2012 | 255 | 1.3% |
| 2013 | 138 | 0.7% |
| (Missing) | 11856 |
| Value | Count | Frequency (%) |
| 2011 | 410 | 2.1% |
| 2012 | 255 | 1.3% |
| 2013 | 138 | 0.7% |
| 2014 | 264 | 1.3% |
| 2015 | 288 | 1.4% |
| 2016 | 375 | 1.9% |
| 2017 | 2607 | |
| 2018 | 3733 |
| Value | Count | Frequency (%) |
| 2018 | 3733 | |
| 2017 | 2607 | |
| 2016 | 375 | 1.9% |
| 2015 | 288 | 1.4% |
| 2014 | 264 | 1.3% |
| 2013 | 138 | 0.7% |
| 2012 | 255 | 1.3% |
| 2011 | 410 | 2.1% |
weekday_transaction
Boolean
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11856 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| True | |
|---|---|
| False | 300 |
| (Missing) |
| Value | Count | Frequency (%) |
| True | 7770 | |
| False | 300 | 1.5% |
| (Missing) | 11856 |
transaction_to_original_diff
Boolean
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11856 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| False | |
|---|---|
| True | 178 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 7892 | |
| True | 178 | 0.9% |
| (Missing) | 11856 |
currency_change
Boolean
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11856 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| False | |
|---|---|
| True | 178 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 7892 | |
| True | 178 | 0.9% |
| (Missing) | 11856 |
transaction_freq_gt_weekly
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11856 |
| Missing (%) | 59.5% |
| Memory size | 155.8 KiB |
| False | |
|---|---|
| F0,"Game, Toy, and Hobby Shops",CAD,P04663,2012-08-15,205.52,CAD,RECREATIONAL & EDUCATIONAL SUPPLIES,205.52,"PARKS, FORESTRY & RECREATION ",2600,2125-115,True,2,201208,2012,True,False,False,False | 1 |
Length
| Max length | 195 |
|---|---|
| Median length | 5 |
| Mean length | 5.023544 |
| Min length | 5 |
Characters and Unicode
| Total characters | 40540 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | False |
|---|---|
| 2nd row | False |
| 3rd row | False |
| 4th row | False |
| 5th row | False |
Common Values
| Value | Count | Frequency (%) |
| False | 8069 | |
| F0,"Game, Toy, and Hobby Shops",CAD,P04663,2012-08-15,205.52,CAD,RECREATIONAL & EDUCATIONAL SUPPLIES,205.52,"PARKS, FORESTRY & RECREATION ",2600,2125-115,True,2,201208,2012,True,False,False,False | 1 | < 0.1% |
| (Missing) | 11856 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| false | 8069 | |
| 2 | < 0.1% | |
| f0,"game | 1 | < 0.1% |
| toy | 1 | < 0.1% |
| and | 1 | < 0.1% |
| hobby | 1 | < 0.1% |
| shops",cad,p04663,2012-08-15,205.52,cad,recreational | 1 | < 0.1% |
| educational | 1 | < 0.1% |
| supplies,205.52,"parks | 1 | < 0.1% |
| forestry | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8075 | |
| F | 8074 | |
| a | 8074 | |
| s | 8073 | |
| l | 8072 | |
| , | 22 | 0.1% |
| 2 | 14 | < 0.1% |
| 11 | < 0.1% | |
| 0 | 11 | < 0.1% |
| A | 8 | < 0.1% |
| Other values (36) | 106 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32310 | |
| Uppercase Letter | 8140 | 20.1% |
| Decimal Number | 46 | 0.1% |
| Other Punctuation | 30 | 0.1% |
| Space Separator | 11 | < 0.1% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 8074 | |
| A | 8 | 0.1% |
| T | 7 | 0.1% |
| E | 7 | 0.1% |
| R | 7 | 0.1% |
| C | 5 | 0.1% |
| S | 5 | 0.1% |
| O | 4 | < 0.1% |
| I | 4 | < 0.1% |
| P | 4 | < 0.1% |
| Other values (8) | 15 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8075 | |
| a | 8074 | |
| s | 8073 | |
| l | 8072 | |
| o | 3 | < 0.1% |
| y | 2 | < 0.1% |
| r | 2 | < 0.1% |
| u | 2 | < 0.1% |
| b | 2 | < 0.1% |
| d | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 14 | |
| 0 | 11 | |
| 1 | 7 | |
| 5 | 7 | |
| 6 | 3 | 6.5% |
| 8 | 2 | 4.3% |
| 3 | 1 | 2.2% |
| 4 | 1 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22 | |
| " | 4 | 13.3% |
| . | 2 | 6.7% |
| & | 2 | 6.7% |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40450 | |
| Common | 90 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8075 | |
| F | 8074 | |
| a | 8074 | |
| s | 8073 | |
| l | 8072 | |
| A | 8 | < 0.1% |
| T | 7 | < 0.1% |
| E | 7 | < 0.1% |
| R | 7 | < 0.1% |
| C | 5 | < 0.1% |
| Other values (22) | 48 | 0.1% |
Common
| Value | Count | Frequency (%) |
| , | 22 | |
| 2 | 14 | |
| 11 | ||
| 0 | 11 | |
| 1 | 7 | 7.8% |
| 5 | 7 | 7.8% |
| " | 4 | 4.4% |
| - | 3 | 3.3% |
| 6 | 3 | 3.3% |
| . | 2 | 2.2% |
| Other values (4) | 6 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40540 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8075 | |
| F | 8074 | |
| a | 8074 | |
| s | 8073 | |
| l | 8072 | |
| , | 22 | 0.1% |
| 2 | 14 | < 0.1% |
| 11 | < 0.1% | |
| 0 | 11 | < 0.1% |
| A | 8 | < 0.1% |
| Other values (36) | 106 | 0.3% |
| original_amount | transaction_month | transaction_year | original_currency1 | trx_currency | division | transaction_gt_50 | transaction_day_of_week | weekday_transaction | transaction_to_original_diff | currency_change | transaction_freq_gt_weekly | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| original_amount | 1.000 | 0.041 | 0.035 | 1.000 | 0.500 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| transaction_month | 0.041 | 1.000 | 0.932 | 0.000 | 1.000 | 0.439 | 0.031 | 0.049 | 0.108 | 0.025 | 0.025 | 0.000 |
| transaction_year | 0.035 | 0.932 | 1.000 | 0.000 | 1.000 | 0.439 | 0.031 | 0.049 | 0.108 | 0.025 | 0.025 | 0.000 |
| original_currency1 | 1.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.501 | 0.045 | 0.500 | 0.022 | 1.000 | 1.000 | 0.000 |
| trx_currency | 0.500 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| division | 1.000 | 0.439 | 0.439 | 0.501 | 1.000 | 1.000 | 0.013 | 0.450 | 0.108 | 0.080 | 0.080 | 0.000 |
| transaction_gt_50 | 0.000 | 0.031 | 0.031 | 0.045 | 0.000 | 0.013 | 1.000 | 0.046 | 0.040 | 0.045 | 0.045 | 0.000 |
| transaction_day_of_week | 1.000 | 0.049 | 0.049 | 0.500 | 1.000 | 0.450 | 0.046 | 1.000 | 1.000 | 0.044 | 0.044 | 0.000 |
| weekday_transaction | 1.000 | 0.108 | 0.108 | 0.022 | 1.000 | 0.108 | 0.040 | 1.000 | 1.000 | 0.024 | 0.024 | 0.000 |
| transaction_to_original_diff | 1.000 | 0.025 | 0.025 | 1.000 | 1.000 | 0.080 | 0.045 | 0.044 | 0.024 | 1.000 | 0.997 | 0.000 |
| currency_change | 1.000 | 0.025 | 0.025 | 1.000 | 1.000 | 0.080 | 0.045 | 0.044 | 0.024 | 0.997 | 1.000 | 0.000 |
| transaction_freq_gt_weekly | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
| Unnamed: 0 | purpose | merchant_name | cost_center_wbls_element_order_description | card_posting_date | merchant_type_mcc | merchant_type_description | original_currency1 | cost_center_wbls_element_order | transaction_date | transaction_amount | trx_currency | gl_account_description | original_amount | division | gl_account | batch_transaction_id | transaction_gt_50 | transaction_day_of_week | transaction_month | transaction_year | weekday_transaction | transaction_to_original_diff | currency_change | transaction_freq_gt_weekly | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1003 | Book: Canadian Urban Regions | indigo books music | HEAD OFF-POLICY&RESR | 2011-06-27 | 5399.0 | Miscellaneous General Merchandise | CAD | UR0005 | 2011-06-26 | 60.85 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 60.85 | URBAN PLANNING | 2020.0 | 1521-44 | False | 6 | 201106.0 | 2011.0 | False | False | False | False |
| 1 | 1002 | update#4 to looseleaf publication:Planning&Zoning | carswell | HEAD OFF-POLICY&RESR | 2011-06-08 | 7338.0 | Quick Copy, Reproduction, and Blueprinti | CAD | UR0005 | 2011-06-07 | 273.45 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 273.45 | URBAN PLANNING | 2020.0 | 1494-34 | True | 1 | 201106.0 | 2011.0 | True | False | False | False |
| 2 | 895 | LooseLeaf Publications updates | carswell | HEAD OFF-POLICY&RESR | 2011-05-24 | 7338.0 | Quick Copy, Reproduction, and Blueprinti | CAD | UR0005 | 2011-05-20 | 281.85 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 281.85 | URBAN PLANNING | 2020.0 | 1471-36 | True | 4 | 201105.0 | 2011.0 | True | False | False | False |
| 3 | 896 | update to looseleaf publication inv#10288391 | rei lexisnexis canada | HEAD OFF-POLICY&RESR | 2011-05-24 | 5942.0 | Book Stores | CAD | UR0005 | 2011-05-20 | 91.3 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 91.30 | URBAN PLANNING | 2020.0 | 1471-37 | False | 4 | 201105.0 | 2011.0 | True | False | False | False |
| 4 | 894 | Book: Rapid Graphs with Tableau Software 6 | createspace | HEAD OFF-POLICY&RESR | 2011-05-16 | 7829.0 | Motion Picture and Video Tape Production | USD | UR0005 | 2011-05-14 | 45.82 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 45.99 | URBAN PLANNING | 2020.0 | 1461-33 | False | 5 | 201105.0 | 2011.0 | False | True | True | False |
| 5 | 857 | update to looseleaf publication inv#7304047 | carswell | HEAD OFF-POLICY&RESR | 2011-04-15 | 7338.0 | Quick Copy, Reproduction, and Blueprinti | CAD | UR0005 | 2011-04-14 | 291.3 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 291.30 | URBAN PLANNING | 2020.0 | 1417-27 | True | 3 | 201104.0 | 2011.0 | True | False | False | False |
| 6 | 856 | Book: Everyday Ethnics for Practicing Planners | apa bookstore | HEAD OFF-POLICY&RESR | 2011-04-08 | 8299.0 | Educational Services | USD | UR0005 | 2011-04-07 | 63.11 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 63.95 | URBAN PLANNING | 2020.0 | 1406-50 | False | 3 | 201104.0 | 2011.0 | True | True | True | False |
| 7 | 854 | Book order #73422064 | abebookscom | HEAD OFF-POLICY&RESR | 2011-04-07 | 5192.0 | Books, Periodicals, and Newspapers | USD | UR0005 | 2011-04-06 | 45.07 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 46.91 | URBAN PLANNING | 2020.0 | 1404-41 | False | 2 | 201104.0 | 2011.0 | True | True | True | False |
| 8 | 1030 | Book:Inclusionary Housing In inter.Perspective | lincoln inst land plcy | HEAD OFF-POLICY&RESR | 2011-03-31 | 8299.0 | Educational Services | USD | UR0005 | 2011-03-30 | 42.74 | CAD | BOOK & MAGAZINE SUBSCRIPTIONS | 42.85 | URBAN PLANNING | 2020 | 1392-45 | False | 2 | 201103.0 | 2011.0 | True | True | True | False |
| 9 | 1029 | Book: Conference Proceedings | paypal makingcitie | HEAD OFF-POLICY&RESR | 2011-03-11 | 8999.0 | Professional Services - Not Elsewhere Cl | CAD | UR0005 | 2011-03-09 | 69.62 | CAD | CONFERENCES/SEMINARS - REGISTRATION FEES | 69.62 | URBAN PLANNING | 4256 | 1365-34 | False | 2 | 201103.0 | 2011.0 | True | False | False | False |
| Unnamed: 0 | purpose | merchant_name | cost_center_wbls_element_order_description | card_posting_date | merchant_type_mcc | merchant_type_description | original_currency1 | cost_center_wbls_element_order | transaction_date | transaction_amount | trx_currency | gl_account_description | original_amount | division | gl_account | batch_transaction_id | transaction_gt_50 | transaction_day_of_week | transaction_month | transaction_year | weekday_transaction | transaction_to_original_diff | currency_change | transaction_freq_gt_weekly | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 19916 | 331,*****,home depot 7013,SCARLETT WOODS-OPERA,2012-07-09,5200.0,Home Supply Warehouse,CAD,P07648,2012-07-06,21.38,CAD,GENERAL HARDWARE,21.38,"PARKS, FORESTRY & RECREATION ",2710,2066-19,False,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19917 | 332,DOG FOOD FOR DOG COURSE DOG,cdn tire store 00182,SCARLETT WOODS-OPERA,2012-07-10,5200.0,Home Supply Warehouse,CAD,P07648,2012-07-06,81.33,CAD,ANIMAL CARE SUPPLIES,81.33,"PARKS, FORESTRY & RECREATION ",2620,2069-12,False,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19918 | 359,ADMISSION FEES -SUMMER CAMP ACTIVITY,montgomerys inn,BLOORDALE CS-SUMR CA,2012-07-09,9399.0,Government Services - Not Elsewhere Clas,CAD,P05092,2012-07-06,103.44,CAD,TICKETS AND ADMISSION FEES,103.44,"PARKS, FORESTRY & RECREATION ",4118,2066-20,True,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19919 | 360,ADMISSION FEES -SUMMER CAMP ACTIVITY,wild water kingdom,BLOORDALE CS-SUMR CA,2012-07-10,7996.0,"Amusement Parks, Carnivals, Circuses, Fo",CAD,P05092,2012-07-06,103.0,CAD,TICKETS AND ADMISSION FEES,103.0,"PARKS, FORESTRY & RECREATION ",4118,2069-13,True,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19920 | 361,ADMISSION FEES -SUMMER CAMP ACTIVITY,wild water kingdom,BLOORDALE CS-SUMR CA,2012-07-10,7996.0,"Amusement Parks, Carnivals, Circuses, Fo",CAD,P05092,2012-07-06,1052.2,CAD,TICKETS AND ADMISSION FEES,1052.2,"PARKS, FORESTRY & RECREATION ",4118,2069-14,True,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19921 | 503,,alpine lawn garden e,TECH SERV. 1-FLEET E,2012-07-09,5261.0,Lawn and Garden Supply Stores,CAD,P00830,2012-07-06,30.5,CAD,MISCELLANEOUS PARTS,30.5,"PARKS, FORESTRY & RECREATION ",2199,2066-26,False,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19922 | 613,SUPPLIES FOR COOKING,walmart 3159,SUMMRCMP-STPHENLCKRC,2012-07-09,5411.0,"Grocery Stores, Supermarkets",CAD,P13279,2012-07-06,178.58,CAD,FOOD & NON-ALCOHOLIC BEVERAGES,178.58,"PARKS, FORESTRY & RECREATION ",2750,2066-30,True,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19923 | 614,FOOD FOR SPECIAL EVENT,walmart 3159,SUMMRCMP-STPHENLCKRC,2012-07-09,5411.0,"Grocery Stores, Supermarkets",CAD,P13279,2012-07-06,48.2,CAD,FOOD & NON-ALCOHOLIC BEVERAGES,48.2,"PARKS, FORESTRY & RECREATION ",2750,2066-31,False,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19924 | 615,CAMP SUPPLIES,walmart 3159,SUMMRCMP-STPHENLCKRC,2012-07-09,5411.0,"Grocery Stores, Supermarkets",CAD,P13279,2012-07-06,13.96,CAD,RECREATIONAL & EDUCATIONAL SUPPLIES,13.96,"PARKS, FORESTRY & RECREATION ",2600,2066-32,False,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 19925 | 616,TOYS & PRIZES FOR CAMP PARTICIPANTS,dollar store,SUMMRCMP-STPHENLCKRC,2012-07-10,5999.0,Miscellaneous and Specialty Retail Store,CAD,P13279,2012-07-06,9.61,CAD,RECREATIONAL & EDUCATIONAL SUPPLIES,9.61,"PARKS, FORESTRY & RECREATION ",2600,2069-20,False,4,201207,2012,True,False,False,False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |